On-line Multichannel Estimation of Source Spectral Dominance
نویسندگان
چکیده
Despite its popularity, multichannel source demixing is intrinsically limited in real-world applications due to the model mismatch between the convolutive mixing model and the actual recordings. Varying number of sources, reverberation, diffuseness and spatial changes are common uncertainties that need to be handled. Post-processing is commonly adopted to compensate for these mismatches, generally in the form of non-linear spectral filtering. In this work we analyze the property of the normalized differences between the output magnitudes of a linear spatial filter. We show that thanks to the time-frequency sparsity of acoustic signals, such distributions can be approximatively modeled by a bimodal Gaussian mixture model. An on-line bimodal constrained GMM fitting is proposed, in order to estimate the posterior probability of source spectral dominance. It is shown that the estimated posteriors can be used to produce a filtered output with very low distortion, outperforming traditional non-linear methods.
منابع مشابه
Postfiltering Using Multichannel Spectral Estimation in Multispeaker Environments
This paper investigates the problem of enhancing a single desired speech source from a mixture of signals in multispeaker environments. A beamformer structure is proposed which combines a fixed beamformer with postfiltering. In the first stage, the fixed multiobjective optimal beamformer is designed to spatially extract the desired source by suppressing all other undesired sources. In the secon...
متن کاملAn Integrated Real-Time Beamforming and Postfiltering System for Nonstationary Noise Environments
In this paper, we present a novel approach for real-time multichannel speech enhancement in environments of non-stationary noise and time-varying acoustical transfer functions (ATFs). The proposed system integrates adaptive beamforming, ATF identification, soft signal detection, and multichannel postfiltering. The noise canceller branch of the beamformer and the ATF identification are adaptivel...
متن کاملDistributed multichannel speech enhancement with minimum mean-square error short-time spectral amplitude, log-spectral amplitude, and spectral phase estimation
In this paper, the authors present optimal multichannel frequency domain estimators for minimum mean-square error (MMSE) short-time spectral amplitude (STSA), log-spectral amplitude (LSA), and spectral phase estimation in a widely distributed microphone configuration. The estimators utilize Rayleigh and Gaussian statistical models for the speech prior and noise likelihood with a diffuse noise f...
متن کاملEfficient Power Spectrum estimation using prewhitening and post coloring technique
The power spectrum estimation for a multichannel autoregressive process using prewhitened and postcoloring technique, which was originally developed for a single channel, is proposed. In order to make the extension, the Cholesky decomposition of the inverse autocorrelation matrix for a multichannel autoregressive process is discussed and the autoregressive model order selection for a multichann...
متن کاملStudy on Combining Ability and Gene Effects Estimation in Some Sweet Corn Inbred Lines (Zea mays L. var saccarata) by Line × Tester Method
In breeding programs determination of gene effects and general and specific combining ability for screening of test crosses is necessary. In order to estimate the genetic variance components and the general and specific combining ability of sweet corn lines, an experiment was conducted using 8 sweet corn S6 inbred lines (including 4 maternal and 4 paternal lines) by line × tester mating design ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015